Astronomy & Astrophysics manuscript no. AA17529 


©ESO 2011 


August 23,2011 





SDSS DR7 superclusters 
Principal component analysis 

M. Einasto 1 , LJ. Liivamag i 1 ' 2 , E. Saar 1 ' 3 , J. Einasto 1 ' 3 ' 4 , E. Tempel 1 , E. Tago 1 , and V.J. Martinez 



o 

(N 

< 
(N 

o 

u 

6 



> 

(N 
m 

o 



X 



Tartu Observatory, 61602 T5ravere, Estonia 

Institute of Physics, Tartu University, Tahe 4, 51010 Tartu, Estonia 
Estonian Academy of Sciences, EE-10130 Tallinn, Estonia 
ICRANet, Piazza della Repubblica 10, 65122 Pescara, Italy 

Observatori Astronomic, Universitat de Valencia, Apartat de Correus 22085, E-46071 Valencia, Spain 



Received ... / Accepted . 



ABSTRACT 



Context. The study of superclusters of galaxies helps us to understand the formation, evolution, and present-day properties of the 
large-scale structure of the Universe. 

Aims. We use data about superclusters drawn from the SDSS DR7 to analyse possible selection effects in the supercluster catalogue, 
to study the physical and morphological properties of superclusters, to find their possible subsets, and to determine scaling relations 
for our superclusters. 

Methods. We apply principal component analysis and Spearman's correlation test to study the properties of superclusters. 
Results. We have found that the parameters of superclusters do not correlate with their distance. The correlations between the physical 
and morphological properties of superclusters are strong. Superclusters can be divided into two populations according to their total 
luminosity: high-luminosity ones with L g > 400 10 10 /?~ 2 Lq, and low-luminosity systems. High-luminosity superclusters form two 
sets, which are more elongated systems with the shape parameter K t /K 2 < 0.5 and less elongated ones with Ki/K 2 > 0.5. The first two 
principal components account for more than 90% of the variance in the supercluster parameters. We use principal component analysis 
to derive scaling relations for superclusters, in which we combine the physical and morphological parameters of superclusters. 
Conclusions. The first two principal components define the fundamental plane, which characterises the physical and morphological 
properties of superclusters. Structure formation simulations for different cosmologies, and more data about the local and high redshift 
superclusters are needed to understand the evolution and the properties of superclusters better. 

Key words, cosmology: observations - cosmology: large-scale structure of the Universe; clusters of galaxies 



1. Introduction 

The large-scale distribution of the dark and baryonic matter in 
the Universe can be described as the cosmic web - the net- 
work of galax ies, groups, and c l usters of galaxies connected 
by filaments jj oeveer et al. 1978; Gregory & Thom pson! 1 1978t 
IZeldovichetalJ 119821: Ide Lapparent et alj 119861) . In this net- 
work superclusters are the largest density enhancements formed 
by the density perturbations on a scale of about 100 h~ l Mpc 
(Hq = lOO/zkms^'Mpc -1 ). Numerical simulations show that 
high-density peaks in the density distribution (the seeds of su- 
percluster cores) are seen already at very early st ages of the for- 
mation and evolution of structure dEinastoll2010h . These are the 
locati ons of the formation of the first objects in t he Universe 
(e.g. JVenemans et alj l2004t iMobasher etall 120051; lOuchi et alj 
2005; iHatch et alj |201 1"). Studying the properties of superclus- 
ters helps us to understand the formation, evoluti on, and proper- 
ties of the large-scale structure of the Universe dHoffman etalj 
120071: 1 Arava-Melo et alj2009al;lBond et al.ll2010l and references 
therein). Comparison of observed and simulated superclusters, 
especially extr eme systems among them, is a test of cosm o- 
logical models ( Kolokot ronis et al]l2002t lEinasto et a"fll2007allel; 
Arava-Melo et alJl2009aHEinasto et al.ll201 lbtlSheth & Diaferid 



The first step in supercluster studies is to compile su- 
percluster catalogues, which serve as observational databases. 
Supercluster catalogues have been constructed using the friend- 
of-friend method or using a smoothed density field of galax- 
ies. The first method has been applied to the data on rich 
(Abell) clusters of galaxies to obtain catalogues of superclus- 
ters of rich cluster s, both from observa ti ons and simulations 
dZucca et alj 119931; lEinasto et al J [19941; iKalinkov & Kuneval 
1995b lEinasto et alJ [19971 120011: IWrav et alJ 120061) . Density 
field superclusters hav e been de t ermin e d using data of deep 
surveys of galaxies dBasilakosI 20031: Einasto et al. 2003a; 
Erdo &du et al]l2004l: lEinasto et al.ll200dl2007btlLiiyamagi et al] 

' '20111). The 
for ex- 



120101; iCosta-Duarte et alJ 1201 U ILuparello etal J 
properties of superclus ters have been studied 
ample, by 



Uaaniste et al.1 Jl998h. iKolokotronis et alJ (120021) 
ICosta-Duarte et alj d2011l) ILuparello et al l d201 ll) . IWrav et all 
d2006l) , and lEinasto et alTd200lL l2007alldleL 120 1 lab . These stud- 
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ies show that the properties of superclusters are correlated. 
More luminous superclusters are richer and larger, contain richer 
galaxy clusters, and have higher maximum densities of galaxies 
than less luminous systems. High-luminosity superclusters are 
more elongated and have more complicated inner structure than 
low-luminosity ones. 

In the present paper we use the Spearman's correlation test 
and the principal component analysis (PCA), an excellent tool 
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for multivariate data analysis, to investigate how strong the cor- 
relations between the properties of superclusters are. Our goals 
are to analyse the presence of possible distance-dependent se- 
lection effects in the supercluster catalogue, to study the cor- 
relations between the physical and morphological properties of 
superclusters, to find the possible subsets and outliers of super- 
clusters, and to determine the scaling relations for the superclus- 
ters. 

Principal component analysis have been used in astron- 
omy for a numb er of purposes: the study of the prop- 
erties of stars dTiit & Einastol 1 19641; iDeemind 1 19641). spec- 
tral classification of galaxies (Sanchez Almeid a et al.l 2010, 
and r eferences therein) , morphological classification of galax- 
ies dCoppa et"al] l2010h studies of galaxies, galaxy groups, 
and dark matter haloes ( Efstathio u & FallllT984c lLanzoni et al] 
2004] iFerreras et al.ll2006t IWoo et all 120081: IChang et al.ll2010t 



Ishida & de S ouzall201 U iToribio et al]|201 ll ISkibba & Macciol 
201 it iJeeson-Daniel et aLll201 ll a nd references therein), fo r the 



Hubble parameter reconstruction (llshida & de Souzall201 ll and 
references therein), and for studies of star formation hist ory in 
the universe using gamma ray bursts (Ishi da et al.ll201 11) . Our 
study is the first in which the PCA is applied to explore the prop- 
erties of superclusters of galaxies. 

In Sect.|2]we give data about superclusters. In Sect.[3]we de- 
scribe the PCA and the Spearman's correlation test, and apply 
them in Sect. [4] to study the physical and morphological prop- 
erties of superclusters and to derive scaling relations for the su- 
perclusters. We discuss selection effects in Sect. [5] and give our 
conclusions in Sect. [6] 

We assume the standard cosmological parameters: the 
Hubble parameter Hq = 100 h km s _1 Mpc -1 , the matter den- 
sity Q. m = 0.27, and the dark energy density Qa = 0.73. 

2. Data 

We selected the MAIN galaxy sample of the 7th data release of 
the Sloan Digital Sky Survey dAdelman- McCarthv et al] 2008: 
lAbazaiian et al .1 120091) with the apparent r magnitudes 12.5 < 
r < 17 '.77, excluding duplicat e entries. The sample is described 
in detail in iTago etalj d2010l) . hereafter T10. We corrected the 
redshifts of galaxies for the motion relative to the CMB and 
computed the co-moving distances (Martinez & Saar 2002) of 
galaxies. 

We calculated the galaxy luminosity density field to recon- 
struct the underlying mass distribution. To determine superclus- 
ters (extended systems of galaxies) in the luminosity density 
field we created a set of density contours by choosing a density 
threshold and define connected volumes above a certain density 
threshold as superclusters. In order to choose proper density lev- 
els to determine individual superclusters, we analysed the den- 
sity field superclusters at a series of density levels. As a result we 
used the density level D = 5.0 (in units of mean density; mean 

luminosity density of our sample is ( meim — 1 .526-10 pj^ ) 
to determine individual superclusters. At this density level su- 
perclusters in the richest chains of superclusters in the volume 
under study still form separate systems; at lower density levels 
they join into huge percolating systems. At higher threshold den- 
sity levels superclusters are smaller and their number decreases. 

In our flux-limited catalogue the luminosity-dependent se- 
lection effects are the smallest at the distance interval 90 h" 1 Mpc 
< Z?com ^ 320 h Mpc. For the present study we chose su- 
perclusters of galaxies in this distance interval. There are 125 
superclusters in the sample. Even the poorest systems in our 



sample contain several groups of galaxies. These systems can 
be compared with the Local supercluster containing one clus- 
ter of galaxies with outgoing filaments. In the Appendix [Al we 
give the details of the calculations of galaxy luminosities and 
of the luminosity density field, as well as of the selection ef- 
fects. The descr i ption of the supercluster catalogues is given in 
iLiivamagi etail OOIOL hereafter L 10). PI 

The superclusters can be characterised by the following 
physical parameters: the total weighted luminosity of galaxies in 
a supercluster, L g , the volume Volume, the diameter Diameter, 
and the number of galaxies in superclusters, A^g a j. The super- 
cluster volume is calculated from the density field as the number 
of connected grid cells multiplied by the cell volume: 



Volume = N sc iA 



(1) 



where A is the grid cell length. 

The total luminosity of the superclusters L g is calculated as 
the sum of weighted galaxy luminosities: 



W L (d ga i)L gal . 

galescl 



(2) 



Here the Wz,(£/ ea i) is the distance-dependent weight of a galaxy 
(the ratio of the expected total luminosity to the luminosity 
within the visibility window). We describe the calculation of 
weights in AppendixlAl The diameter of a supercluster is defined 
as the maximum distance between its galaxies. The distance of 
a supercluster is the distance to it's density maximum. The peak 
density £>p ea i, is that of the highest density peak within the su- 
percluster. Usually the highest values of densities coincide with 
the richest cluster of galaxies in a supercluster. For details we 
refer to L10. 

The overall morphology of a supercluster is described by 
the shapefinders K\ (planarity) and K2 (filamentarity), and their 
ratio, Kx/Ko (the shape parameter). The shapefinders are cal- 
culated using the volume, area, and integrated mean curvature 
of a supercluster; they contain information both about the sizes 
of superclusters and about their outer shape. Systems with dif- 
ferent shapes and si milar sizes have different shape parameters 
(lEinasto et al.l l2008). For the first time th e shapefinders were ap- 
plied in the studies of galaxy systems by Basilak os et al.l ([200 1 ) 
who analysed the shapes of the PSCz superclusters. We use 
the maximum value of the fourth Minkowski functional V3 (the 
dumpiness) to characterise the inner structure of the superclus- 
ters. The larger the value of V3, the more complicated the inner 
morphology of a supercluster is; superclusters m ay be clumpy, 
and they also may have holes or tunnels in them ( lEinasto et al.l 
l2007d,l201 lblh The formulae for the Minkowski functionals and 
shapefinders are given in ApplBl 

The large-scale distribution of superclusters is shown in 
Fig. [D in cartesian co ordinat es. These coord i nates are defined 



as in 



Park et al.] d2007l) and in lLiivamagi et al] d2010): 



x = -dsinA, 
y - d cos A cost], 
z = dcos A sin 77, 



(3) 



where d is the c omoving distance, and A and tj are the SDSS sur- 
vey coordinates. lEinasto et al] (1201 1 ah gave detailed description 
of the large-scale distribution of rich superclusters. 



The supercluster catalogues can be downloaded from: 



|http://at mos . physic .ut . ee/- juhan/super/ 
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Fig. 1. The distribution of superclusters in cartesian coordinates, in units of h 1 Mpc. The filled circles denote superclusters with the 
luminosity L g > 400 10 l0 h~ 2 L@, empty circles denote less luminous superclusters. The numbers are ID's of luminous supercluster 
fromL10(Table|CB- 




1.0 1.0 

N(gal) 



Fig. 2. Distribution of the standardised physical parameters of superclusters. From left to right: the total weighted luminosity of 
galaxies L g , the volume and the diameter of superclusters, the density of the highest density peak inside superclusters, D^^, and 
the number of galaxies in superclusters, A^g a j. 



3. Principal component analysis 

The idea of the principal component analysis is to find a small 
number of linear combinations of correlated parameters to de- 
scribe most of the variation in the dataset with a small number 
of new uncorrelated parameters. The PCA transforms the data 
to a new coordinate system, where the greatest variance by any 
projection of the data lies along the first coordinate (the first prin- 
cipal component), the second greatest variance - along the sec- 
ond coordinate, and so on. There are as many principal compo- 
nents as there are parameters, but typically only the first few are 
needed to explain most of the total variation. 

Principal components PCx (x e N, x < N, ol ) are a linear 
combination of the original parameters: 



'not 



(4) 



i=i 



where - 1 < a(i) x < 1 are the coefficients of the linear transfor- 
mation, Vi are the original parameters and iV tot is the number of 
the original parameters. 

PCA is suitable tool to study simultaneously correlations be- 
tween a large number of parameters, for finding subsets in data, 
and detecting outliers. Linear combinations of principal compo- 
nents can be used to reproduce parameters characterising objects 
in the dataset. 

Principal components can be used to derive scaling relations. 
If data points lie along a plane, defined by the first two principal 
components, then the scaling relations along this plane are de - 
fined by the third principal component (Efstathiou & Fall 1984). 



For the analysis we use standardised parameters, centred on their 
means (V, - V,) and normalised (divided by their standard devi- 
ations, cr( V,)). Therefore we obtain for the scaling relations: 



2 fl(0: 



(Yi - Vd 
criVd 



= 0. 



(5) 



For PCA, the parameters should be normally distributed. 
Therefore we use the logarithms of parameters in most cases; 
this makes the distributions more gaussian, and the range over 
which their values span are smaller, especially for luminosities 
and volumes. We do not use logarithms of morphological data, 
in order to not to exclude from the analysis those with negative 
values of shapefinders, which may occur in the ca se of compact 
superclusters with a complex overall morphology (Einasto et al. 
2008. 1201 lbl) . Figures [2] and [3] show the distribution of the val- 
ues of the standardised parameters. Deviations from the nor- 
mal distribution are mostly caused by the most luminous (or by 
the poorest for the shape parameter) superclusters in our sam- 
ple. In Table Q] we give the mean values and standard devia- 
tions of supercluster parameters. For poor superclusters of "spi- 
der" morphology the shape parameter is not always well defined 
(lEinasto et al.l201 lah . For five systems the value of the shape pa- 
rameter \K\IKz\ > 4; therefore we also calculated the mean value 
and standard deviation of the shape parameter without these sys- 
tems (denoted as KxlK^)- This effect does not affect the values 
of other parameters, thus we did not exclude these systems from 
our calculations. 
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We present in tables the values of principal components and 
the standard deviations, proportion of variance, and cumula- 
tive variance of principal components. The values of compo- 
nents show the importance of the original parameters in each 
PCx. We plot the principal planes for superclusters. For the cal- 
culations we used command prcomp from R, an open-source 
free statistical environme nt developed under the GNU GPL 
(Ihaka & Gentleman ! 19961 http://www.r-project .org). 

To study correlations between properties of superclusters, we 
applied Spearman's rank correlation test, in which the value of 
the correlation coefficient r shows the presence of correlation 
(r = 1 for perfect correlation), anticorrelation (r = — 1 for perfect 
anticorrelation), or the absence of correlations when r*0. 

Table 1. Mean values and standard deviations of supercluster 
parameters. 



(1) 


(2) 


(3) 


Parameter 


mean 


sd 


log(L,) 


2.367 


0.378 


log(V olume) 


2.813 


0.571 


\og(Diameter) 


1.179 


0.258 


iog^peak) 


0.856 


0.119 


log(W gal ) 


2.219 


0.435 


\og(Dist.) 


2.379 


0.113 


V, 


1.770 


1.185 


A', 


0.015 


0.031 


K 2 


0.027 


0.069 


Ki/K 2 


-0.050 


3.701 


kjk; 


0.338 


0.756 



Notes. L g - the total weighted luminosity of galaxies in superclusters 
in units of 10' /i~ 2 Lq; Volume - in units of (fr~'Mpc) 3 ; Diameter - 
in Mpc/7;; A'gaj - the number of galaxies in superclusters; Dp ea ^ - the 
density of the highest density peak inside superclusters, in units of mean 
density; Dist - the distance in Mpc/ft; V3 is the maximum value of the 
fourth Minkowski functional, K t is the planarity, K 2 is the filamentar- 
ity, and the ratio, K\jK 2 , is the shape parameter of superclusters (see 
Section [2] for definitions). Ki/IQ denotes the shape parameter for the 
supercluster sample from which we excluded five most noisy values as 
explained in the text. 




-2.0 0.0 2.0 4.0 -2.0 0.0 2.0 4.0 -2.0 0.0 2.0 4.0 -2.0 0.0 2.0 

V3 K1 K2 K1/K2' 

Fig. 3. Distribution of the standardised morphological parame- 
ters of superclusters. From left to right: the maximum value of 
the fourth Minkowski functional V3, the planarity K\, the fila- 
mentarity K2, and the shape parameter of superclusters, K\jK* T 



4. Results 

4. 1 . PCA with physical parameters of superclusters 

We start the calculations of principal components using physical 
characteristics of superclusters and their distances. Including the 
supercluster distances may show possible correlations between 



the other parameters of superclusters and their distance, which 
will indicate that the parameters of superclusters are affected by 
distance-dependent selection effects. 

Table 2. Results of the principal component analysis, with the 
distances of superclusters included. 



(1) 


(2) 


(3) 


(4) 




PCI 


PC2 


PC3 


log(JV gal ) 


-0.444 


0.264 


-0.108 


log(L e ) 


-0.455 


-0.149 


-0.097 


\og(Diameter) 


-0.441 


-0.133 


-0.542 


log(V 'olume) 


-0.454 


-0.126 


-0.042 


lo g( D peak) 


-0.427 


-0.062 


0.825 


\og(Distance) 


0.100 


-0.932 


0.012 


Importance of component 


s 








PCI 


PC2 


PC3 


Standard deviation 


2.148 


1.046 


0.466 


Proportion of Variance 


0.769 


0.182 


0.036 


Cumulative Proportion 


0.769 


0.951 


0.987 



Notes. Notations given in Section[2] 



Table 3. Results of the Spearman's rank correlation test. 



(1) 


(2) 




(3) 


Parameters 


r 




P 


\og(Dist.) vs. log(Lg) 


-0.06 




0.50 


\og(Dist.) vs. log(iV gal ) 


-0.49 


9.8 


e-9 


log(D(.?f.) vs. \og(Diameter) 


-0.11 




0.20 


\og{Dist.) vs. \og(Volume) 


-0.08 




0.40 


log(Dist.) vs. log(» peak ) 


-0.09 




0.33 


log(D('.?f.) vs. V 3 


-0.03 




0.78 


log(Dist.) vs. A'i 


-0.08 




0.37 


log(Dist.) vs. Ki 


0.04 




0.70 


\og{Dist.) vs. K { /K 2 


-0.05 




0.58 


\og(L g ) vs. logCiVga]) 


0.88 


< 2.2e 


- 16 


log(Lj) vs. \og(Diameter) 


0.95 


< 2.2e 


- 16 


log(L ? ) vs. \og(Volume) 


0.98 


< 2.2e 


- 16 


\og(L s ) vs. log(» peak ) 


0.94 


< 2.2e 


- 16 


\og(L g ) vs. V 3 


0.75 


< 2.2e 


- 16 


log(L g ) vs. Ki 


0.89 


< 2.2e 


- 16 


log(L ? ) vs. K 2 


0.82 


< 2.2<? 


- 16 


log(L g ) vs. Ki/K 2 


0.19 




0.04 



Notes. Rank correlation coefficient r and the p-value p. The values 
p < 0.05 mean that the results are statistically of very high significance. 

Table [2] presents the results of this analysis. We show the 
values of only the first three principal components, enough for 
this test. The coefficients of the first principal component of the 
physical parameters are of almost equal value, while the coeffi- 
cient corresponding to the distance is very small - the first prin- 
cipal component accounts for most of the variance of the physi- 
cal parameters of superclusters. The second principal component 
accounts for most of the variance of the distances of superclus- 
ters. This shows that the physical parameters of superclusters are 
not correlated with distance. To ensure that this interpretation 
is correct we carried out the Spearman's tests for correlations 
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(Table [3]). These tests showed a weak anticorrelation between 
the distance and the number of galaxies in superclusters, with a 
high statistical significance. This is not surprising since the cat- 
alogue of superclusters is based on the flux-limited sample in 
which the number of galaxies in superclusters depends on the 
distance. The sample of superclusters was chosen from a rela- 
tively narrow distance interval, so this dependence is weak. For 
other parameters of superclusters (luminosity, diameter, volume, 
and peak density), the tests showed a very weak correlation with 
distance (Spearman's rank r * 0.1 or less), but with no statistical 
significance, as the p-values show. Therefore we conclude that 
there are no correlations between the distances and physical pa- 
rameters of superclusters, and the distance-dependent selection 
effects have been properly taken into account when generating 
the supercluster catalogue and calculating the physical proper- 
ties of superclusters. 



Table 4. Results of the principal component analysis for the 
physical parameters. 



(1) 


(2) 


(3) 


(4) 


(5) 


(6) 




PCI 


PC2 


PC 3 


PC4 


PC5 


log(/V gal ) 


-0.439 


0.056 


0.895 


-0.036 


-0.018 


log(L s ) 


-0.460 


0.112 


-0.217 


-0.047 


0.851 


\og(Diameter) 


-0.445 


0.557 


-0.238 


0.561 


-0.344 


log(V olume) 


-0.458 


0.058 


-0.268 


-0.761 


-0.367 


lo g( D peak) 


-0.430 


-0.818 


-0.149 


0.319 


-0.144 


Importance of components 




PCI 


PC2 


PC 3 


PC4 


PC5 


St. deviation 


2.139 


0.467 


0.377 


0.193 


0.161 


Prop. Variance 


0.915 


0.043 


0.028 


0.007 


0.005 


Cum. Proportion 


0.915 


0.958 


0.987 


0.994 


1.000 



of the tests of other correlations are similar. Especially tight are 
the correlations between the luminosities, the diameters, and the 
volumes of superclusters, as the correlation coefficients show in 
Table [3] 
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0.0 

PC2 

-2.5 
-5.0 

2.5 



-5.0 



_ o CD O 

o° o s o-, .VK-..:->->M"i... - 

o a »° 
o o 
o o 




o o 


1 1 1 1 — 

o ° ° . 

o So °%%<&& • 


-5.0 -2.5 0.0 2.5 
PC3 



-5.0 -2.5 0.0 2.5 
PC1 



Fig. 4. Principal planes for superclusters, PCA with physical pa- 
rameters. Open circles: high-luminosity superclusters with lu- 
minosity L g > 400 1O 1o /i~ 2 L0, grey dots: superclusters of lower 
luminosity. 



Notes. Notations as in Table[2] 



We will proceed with the analysis of superclusters, tak- 
ing only the physical parameters into account. Table [4] which 
presents the results of this analysis, demonstrates that the co- 
efficients of the first principal component are almost equal for 
different parameters of superclusters. Therefore the parameters, 
which describe the full supercluster (the luminosity, richness, 
diameter, and volume), are almost equally important in deter- 
mining the supercluster properties. The cumulative variance in 
Table [4] shows that the first two principal components account 
for more than 95% of the total variance in this supercluster sam- 
ple. The first principal component accounts for most of the vari- 
ance of the overall parameters of superclusters. The values of 
the second principal component show that the largest remaining 
variance in the sample comes from the peak density of super- 
clusters. The values of the third principal component show that 
the coefficients corresponding to the luminosity, volume, and di- 
ameter have almost equal negative values, while the number of 
galaxies has large positive coefficients. 

The PCA therefore suggests that the physical parameters of 
superclusters are strongly correlated. We checked for the pres- 
ence of the correlations between the parameters with Spearman's 
tests, which showed that the correlations between the parameters 
of superclusters are statistically of very high significance, both 
between the overall parameters of superclusters and between the 
overall parameters and the peak density inside the superclusters 
(Table |3j. We only present the correlations between the lumi- 
nosity and other parameters, to keep Table [3] short. The results 



Let us take a look at the locations of superclusters in the prin- 
cipal planes (Fig. [4]). The upper lefthand panel shows the distri- 
bution of superclusters in the principal plane PC1-PC2. Most su- 
perclusters form here an elongated cloud with a very small scat- 
ter. These are low-luminosity superclusters with the luminosity 
L g < 400 10 10 Ii~ 2 Lq. The scatter of positions of high-luminosity 
superclusters is larger. This suggests that we can divide super- 
clusters into two populations according to their total luminosity. 
The transition between populations is smooth. We give the data 
about high-luminosity superclusters in Table IC.ll The luminous 
superclusters with a high value of the peak density have higher 
negative values for the second PC, and the supercluster SCI 001 
has the largest negative value of PC2. The superclusters with 
a lower value of the peak density have positive values of the 
second PC. The richest supercluster in the sample, SCI 061, is 
among them. This supercluster has the highest negative value of 
PCI. The lefthand panels of Figure [4] show that the more lumi- 
nous the supercluster, the higher is the negative value of it's first 
principal component. The value of the peak density inside super- 
clusters determines the location of superclusters along the axis 
of the second principal component. In PC1-PC3 plane (lower 
left panel of Fig. |4| superclusters also form an elongated cloud 
with larger scatter of high-luminosity superclusters. Upper right 
panel (PC3-PC2 plane) shows the third view of this cloud. Such 
an elongated, prolate shape is characteristic of the planar distri- 
bution on PC1-PC2 plane dWoo et al.ll2008l) . which defines the 
fundamental plane for superclusters. 
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Table 5. Results of principal component analysis for the lumi- 
nosity and morphological properties of superclusters. 



(1) 


(2) 


(3) 


(4) 


(5) 


(6) 




PCI 


PC2 


PC 3 


PC4 


PC5 


log(L s ) 


-0.489 


0.004 


-0.655 


0.373 


-0.437 


v 3 


-0.490 


-0.044 


0.596 


0.608 


0.173 


K 


-0.511 


-0.023 


-0.297 


-0.331 


0.734 


K 2 


-0.505 


-0.056 


0.351 


-0.615 


-0.488 


K1/K2 


-0.059 


0.997 


0.042 


-0.016 


-0.000 


Importance of components 




PCI 


PC2 


PC 3 


PC4 


PC5 


St.deviation 


1.891 


0.996 


0.528 


0.313 


0.235 


Prop. Variance 


0.715 


0.198 


0.055 


0.019 


0.011 


Cum. Proportion 


0.715 


0.913 


0.969 


0.988 


1.000 



Notes. As in Section|2] 



4.2. PCA for the morphological parameters of superclusters 

Next, we use the PCA to study the morphological and physi- 
cal properties of superclusters simultaneously. From the phys- 
ical characteristics we only include the total luminosity, which 
is sufficient since the physical parameters of superclusters are 
strongly correlated. Table [3] shows that the mophological pa- 
rameters of superclusters are not correlated with their distances. 
Table [5] shows the results of PCA for the luminosity and the 
morphological parameters. Here the absolute values of compo- 
nents for the luminosity, the dumpiness, and the shapefinders 
K\ and K2 are almost equal. Therefore the luminosity and these 
morphological parameters are equally important in shaping the 
properties of superclusters. The second principal component ac- 
counts for most of the variance of the shape parameter K\IKi. 
The higher the negative value of the PCI for the supercluster, 
the more luminous the supercluster, has higher value planari- 
ties and filamentarities, and higher maximal value of the fourth 
Minkowski functional V3, hence a richer inner morphology. 

Table [5] shows that the first two principal components ac- 
count for about 93% of the total variance in the data. 

The Spearman's tests (Table [3} showed that the correlations 
between the supercluster luminosity and its morphological pa- 
rameters are statistically highly significant. The correlation be- 
tween the luminosity and the shape parameter of superclusters is 
weak. 

Figure [5] presents the locations of superclusters in the prin- 
cipal planes, defined by the luminosity and morphological pa- 
rameters of superclusters. The upper lefthand panel shows the 
distribution of superclusters in the principal plane PC1-PC2. 
Here both high- and low-luminosity superclusters form an elon- 
gated cloud with very small scatter. The scatter of positions of 
the high-luminosity superclusters in PC1-PC3 plane is greater. 
Again, the more luminous the supercluster, the higher the nega- 
tive value of its first principal component. High values of PCI 
(and the highest values of PC3) correspond to luminous su- 
perclusters with high values of dumpiness V3 (Table |5}- Large 
scatter along the second principal component PC2 in princi- 
pal planes correspond to superclusters with high values of the 
shape parameter K\ /K2. These are poor superclusters of "spider" 
morphology, for wh ich the shape parameter is not well defined 
(Eina sto et al.ll201 Tah . We see that the luminosity and the mor- 
phological parameters of superclusters also define a fundamental 
plane for superclusters, where the physical and morphological 
properties are combined. 
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Fig. 5. Principal planes for superclusters. PCA with the morpho- 
logical parameters. Open circles: high-luminosity superclusters 
with luminosity L g > 400 10 10 A"" 2 Zq; grey dots: superclusters 
of lower luminosity. 

4.3. Scaling relations for superclusters and the fundamental 
plane 



Table 6. Results of principal component analysis for luminosity, 
diameters, and shapefinders. 



(1) 


(2) 


(3) 


(4) 




PCI 


PC 2 


PC 3 


log(L e ) 


-0.5713 


0.7905 


-0.2205 


KID 


-0.5833 


0.2020 


0.7867 


KID 


-0.5773 


0.5781 


-0.5765 




Importance 


of components 




PCI 


PC2 


PC3 


St.deviation 


1.696 


0.308 


0.165 


Prop. Variance 


0.959 


0.031 


0.009 


Cum.Proportion 


0.959 


0.990 


1.000 



Notes. log(L g ): logarithm of the total luminosity of superclusters, 
KID = (1 - Ki) ■ log(Diameter), and K2D = (1 - K 2 ) ■ log(Diameter). 

The results of the PCA suggest that the first two princi- 
pal components define the fundamental plane for superclus- 
ters. This motivates us to find the scaling relations between 
the supercluster parameters. The scaling relations have ear- 
lier been found between the properties of galaxies, of groups 



of galaxies and of dark matter haloes (jFaber & Jackson 



Tullv & Fisher 1977 



Kormendvll 19771: l Efstathio u & FallL, 
Diorgovski & Davis 1987; Dressier et al. 1987; Schaeffer et al 
1993b lAdami et all 1 19981; lLanzoni et all 120041; Ip'Onofrio et al 
booalWoo et al.ll2008l:lArava-Melo et alJl2009bl and references 
therein). 

For scaling relations we use Eq. (|5]l and perform the PCA 
for the parameters log(L g ), (1 — K\) ■ \og{Diameter) and (1 - 
K2) ■ \og{Diameter). This set combines the easily detectable di- 
ameter of superclusters, and morphological parameters K\ and 
K2, which characterise the sizes and the shapes of superclus- 
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Fig. 6. Principal planes for superclusters. PCA for the luminos- 
ity, diameter, and shapefinders as described in the text. Open cir- 
cles: high-luminosity superclusters with luminosity L g > 400 
10 m h~ 2 LQ, grey dots: superclusters of lower luminosity. 



ters, with the total luminosity of superclusters. For low values of 
shapefinders, (1 -*i ) and (1 - *2) are less noisy than * and *2 
(Einasto etaLll20 11 ah . 
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Fig. 7. Lg(ob S erved) vs. ^(predicted) , in units of 10 lo h 2 L Q . Open 
circles denote high-luminosity superclusters with the luminosity 
L g > 400 10 I0 /r 2 L© and the shape parameter K\ /K 2 > 0.5 (less 
elongated superclusters), filled squares denote high-luminosity 
superclusters with the shape parameter *i /*2 < 0.5 (more elon- 
gated superclusters), grey dots denote superclusters of lower lu- 
minosity. 

Tableland Fig.[6]present the principal components and prin- 
cipal planes of superclusters. Table [6] shows that the first two 
principal components account for 99% of the total variance of 
the parameters. The highest positive values of PC3 in Fig. [6] 
come from high-luminosity, very elongated superclusters. The 
values of PC3 for superclusters SCI 061 and SCI 094 were much 
higher than for other superclusters, therefore we excluded them 



from the calculations as outliers. These are the richest, most lu- 
minous, and most elongated system s with the largest cl umpi- 
ness in our sample (Table ICTI and lEinasto et al.ll20 1 la). The 
supercluster SCI 061 is the richest member of th e Sloan Great 
Wall, an exceptional system in t he nearby universe ( Einast o et al.l 
1201 lbt ISheth & Diafaioll20Tlh . The supercluster SCI 094 (the 
Corona Borealis supercl uster) is the richest system in the dom- 
inant superclust er plane ( Einasto et al.ll2.01 la) . This system has 
b een studied by Small et al. (1998); look also at the references 
in lEinasto et al.l (1201 lal) . 

Equation © and Fig.|7]show the resulting scaling relation. 



log(L„) = (5.11* 2 - 5.87* - 0.76) ■ log(D) + 1.29, 



(6) 



where D denotes diameter. The standard deviation for the rela- 
tion sd = 0.414. Most of the scatter comes from the parame- 
ters of luminous superclusters, for them sd = 0.507, for low- 
luminosity superclusters sd = 0.183. 

In Fig. [7] we denote the high-luminosity superclusters with 
different symbols, according to their shape parameter. Figure [7] 
shows that more elongated and less elongated high-luminosity 
superclusters populate the Lg( bserved)-£g(predicted) plane differ- 
ently. This suggests that luminous superclusters can be divided 
into two populations according to their shapes. Our calculations 
show that there is no such difference for low-luminosity super- 
clusters. The differences between the observed and predicted lu- 
minosity are the largest for five systems with the highest pre- 
dicted luminosity in Fig. [7] These are very elongated luminous 
superclusters in the sample, systems of (multibranching) fila- 
ment morphology, SCI 064, SCI 189, SCI 336, and SCI 474, 
and a multispider SCI 5 30 (for morphologi cal classification of 
superclusters we refer to lEinasto et al.ll201 lah . 

Next we derived the scaling relations separately for more 
elongated and less elongated high-luminosity superclusters (cor- 
respondingly, Eq. (|7]i and Eq. ©), and for all low-luminosity 
superclusters (Eq. : 

log(Lg) = (0.22* 2 - 1 .67tfi + 1 .45) ■ log(D) + 0.69 (7) 

log(L g ) = (3.45*2 - 3.95*! + 0.50) ■ log(D) + 2.09 (8) 

log(L g ) = (63.80*2 - 62.28*! - 1.52) • log(D) + 3.81 (9) 

Figure |8] demonstrates the observed vs. predicted luminos- 
ity of superclusters found with these relations. Now luminosities 
of high-luminosity superclusters are recovered well, with a very 
small scatter (sd = 0.16 and sd = 0.22 for more elongated and 
less elongated superclusters). Interestingly, this figure shows the 
absence of the correlation between the observed and predicted 
luminosity for low-luminosity superclusters. To understand this, 
we plot in Fig.|9]the shapefinders K\ - *2 plane for superclus- 
ters where the size of symbols is proportional to the diame- 
ters of superclusters. Here the values of shapefinders for high- 
luminosity superclusters are correlated, and these superclusters 
also have larger sizes. Most low-luminosity superclusters have 
very low, uncorrelated values of shapefinders (both * and *2 
< 0.025). For the smallest systems they are even negative. An 
examp le of such a system is the Virgo supercluster ( Einasto et al. 
2007e). The results of the PCA show that, while pairwise corre- 
lations between the luminosity and other parameters in Table [3] 
are strong, the correlations between several parameters (diame- 
ters, shapefinders, and luminosities for most of low-luminosity 
superclusters) are almost absent, and so the scaling relation for 
them cannot be derived. Correlation between the observed and 
predicted luminosity for low-luminosity superclusters in Fig. [7] 
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Fig. 8. L^(observed) vs. Ly(predicted), in units of 10 w h Lq. Open 
circles denote high-luminosity superclusters with the luminosity 
L g > 400 10 I0 /r 2 L Q and the shape parameter K\ /K 2 > 0.5 (less 
elongated superclusters), squares denote high-luminosity super- 
clusters with the luminosity L g > 400 10 I0 /i~ 2 Lq and the shape 
parameter K1/K2 < 0.5 (more elongated superclusters), and grey 
dots denote superclusters of lower luminosity. 
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Fig. 9. Shapefinders K\ - K2 plane for superclusters. The size of 
symbols is proportional to the diameters of superclusters. Open 
circles denote high-luminosity superclusters with the luminos- 
ity L g > 400 lO lo /r 2 L and the shape parameter Ki/K 2 > 0.5 
(less elongated superclusters), squares denote high-luminosity 
superclusters with the luminosity L g > 400 10 i0 /!~ 2 Lq and the 
shape parameter K1/K2 < 0.5 (more elongated superclusters), 
grey filled circles denote superclusters of lower luminosity. 



5. Selection effects 

The main selection effect in our study comes from the use of 
a flux-limited sample of galaxies to determine the luminosity 
density field and superclusters. To have luminosity-dependent 
selection effects as small as possible, we used data about galaxies 
and galaxy systems from a distance interval 90 - 320 h~ l Mpc, 
in which these effects are the least (we refer to T10 for details). 
We showed above that the parameters of superclusters (except 
the number of galaxies) do not correlate with distance, which 
shows that the distant-dependent selection effects are correctly 
taken into account when generating the supercluster catalogue. 

If the number of cells used to define superclusters is too 
small then the supercluster catalogue may include objects that 
cannot be considered as real superclusters. Moreover, the de- 
tection of the shape parameter becomes unreliable. If the shape 
parameter is determined using the inertia tensor method then 
superclusters have to be defined using at least eight members 
(iKolokotronis et al.l200ll) . In our study we determine shapefind- 
ers with Minkowski functionals, and the minimum number of 
cells for defining superclusters is 64 (Appendix [A}. We anal- 
ysed systems in a distance interval where the selection effects 
are small. Even the poorest systems contain at least 25 to 30 
galaxies and several groups of galaxies. Therefore the detec- 
tion of the shape parameter may only be affected weakly by 
the selection effects except for the poorest systems of "spider" 
morp hology for which t he sha pefinders may be noisy. We note 
that ICosta-Duarte et al.l (I201 II) include systems with at least ten 
member galaxies in their supercluster catalogue to study of the 
shape parameter of superclusters. 

Another selection effect comes from the choice of the thresh- 
old density to determine superclusters. At the density level used 
in the present paper (D = 5.0), rich superclusters do not perco- 
late yet. If we use a lower threshold density, new galaxies are 
added to superclusters, and some superclusters may join to form 
huge systems. At a higher density level, galaxies in the outskirts 
of the superclusters no longer belong to superclusters, and su- 
perclusters become poorer and smaller. 



Table 7. Results of the principal component analysis for the 
threshold density level D — 5.5. 



(1) 


(2) 


(3) 


(4) 


(5) 


(6) 




PCI 


PC2 


PC3 


PC4 


PC5 


logA'ga! 


-0.437 


0.085 


0.889 


-0.082 


0.052 


logLg 


-0.460 


0.093 


-0.146 


0.854 


-0.166 


logDiameter 


-0.447 


0.523 


-0.282 


-0.443 


-0.498 


log Volume 


-0.461 


0.094 


-0.298 


-0.150 


0.816 


\ogD peak 


-0.428 


-0.837 


-0.136 


-0.208 


-0.233 


Importance of components 




PCI 


PC2 


PC3 


PC4 


PC5 


St.deviation 


2.127 


0.484 


0.406 


0.214 


0.172 


Prop. Variance 


0.904 


0.046 


0.033 


0.009 


0.005 


Cum.Proportion 


0.904 


0.951 


0.984 


0.994 


1.000 



Notes. Notations as in Table [2] 



comes from the high-luminosity superclusters and from the low- 
luminosity superclusters with the shape parameters K\ and K2 
> 0.025. 



To see the sensitivity of the PCA results to the small differ- 
ences in the choice of the threshold density, we compared the 
results of the PCA for superclusters chosen at higher and lower 
threshold density levels. As an example we show in Table [7] 
the coefficients of the principal components for the superclus- 
ters chosen at the threshold density level D = 5.5. At this den- 
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Table 8. Results of the Spearman's rank correlation test for the 
threshold density level D = 5.5. 



(1) 


(2) 




(3) 


Parameters 


r 




P 


log(L^) vs. logCNggj) 


0.85 


< l.le 


- 16 


log(L g ) vs. \og(Diameter) 


0.94 


< l.le 


- 16 


log(L ? ) vs. \og{Volume) 


0.98 


< l.le 


- 16 


log(L s ) vs. log(£» peak ) 


0.94 


< l.le 


- 16 



Notes. Rank correlation coefficient r and the p-value p. 



sity level, Lup arello et alJ d201 ll) determined superclusters in the 
SDSS-DR7 for volume-limited samples of galaxies. We used 
flux-limited samples, thus the density levels cannot be compared 
directly, but we can still choose this level for the present test. 
Table [8] shows the results of the Spearman's correlation test for 
this density level. The comparison with Tables [4] and [3] shows 
that the coefficients are almost the same. Therefore the results 
of the correlation test and the PCA are not very sensitive to the 
choise of the density level. 

6. Discussion and conclusions 

We studied the properties of superclusters drawn from the SDSS 
DR7 using the principal component analysis and Spearman's 
correlation test. Several earlier studies have shown that the 
properties of superclusters are correlated (see the references in 
Sect. [TJ. However, it is surprising that the correlations between 
the various properties of superclusters are so tight. The first two 
principal components account for most of the variance in the 
data. Different physical parameters (the luminosity, volume, and 
diameter) and the morphological parameters (the dumpiness and 
the shape parameters) are almost equally important in shaping 
the properties of superclusters. This suggests that superclusters, 
as described by their overall physical and morphological prop- 
erties and by their inner morphology and peak density, are ob- 
jects that can be described with a few parameters. We derived 
the scaling relation for superclusters in which we combine their 
luminosities, diameters, and shapefinders. 

We saw in Fig. [7] that more elongated and less elon- 
gated high-luminosity superclusters populate the Lg^bsemed)- 
Lg(predicted) plane differently. This suggests that luminous su- 
perclusters can be divided into two populations according to 
their shapes - more elongated systems with the shape param- 
eter K1/K2 < 0.5 an d less elongated ones with K1/K2 > 0.5. 
Einas to et al.l d201 lal) got a similar result using multidimensional 
normal mixture modelling. It is remarkable that two different 
multivariate methods reveal information about the data in such 
good agreement. However, there are few high-luminosity super- 
clusters in our sample. There are 14 systems with the shape 
parameter K1/K2 < 0.5 among them, and 17 systems with 
K1/K2 > 0.5. A larger sample of superclusters has to be anal- 
ysed to confirm this result. 

Parameters used to characterise superclusters in the present 
study do not reflect all the properties of superclusters. 
For example, rich superclusters contain high-density cores 
that ma y contain merging X-ray clusters and may be col- 
lapsing (ISmall et al] 1 19981 iBardelli et al.l 120001: lEinasto et alJ 



fects the prop erties of ; 
cated ther e ( Einast o et al. 
2005t [ Haines et all l2006t 
2008; Tempel et all 120091 



alaxies, 



grgujjs, 
Plionis 



and 



clusters 

1 IT 



lo- 



2003b; Plionis 2004; Wolfetal 
Einasto et al. 2007d; Porte r et al 
Fleenor & Johnston-Hollitt 2010: 
Tempel et al.11201 It lEinasto et al.ll2.01 lbl) . lEinasto et al.l d201 lbt 



showed that the dynamical evolution of one of the richest su- 
perclusters in the Sloan Great Wall (SCL 111, SCI 024 in L10 
catalogue) is almost finished, while the richest member of the 
Wall, SCI 126 (SCI 061) is still dynamically active. Therefore 
our results reflect only certain aspects of the properties of super- 
clusters. 



studied b 1 


/ a number of authors ( 


Pandev & Bharadwai 


2005; 


Gott et al. 


20051 Park et all 2005: 


Pandev & Bharadwai 


2006; 


Gottet al. 


2008; Pandev & Bharadwai 2008; Kitaura et a 


2009; 


Choi et al 


1 120101 ISousbie et all 1201 it lEinasto et al. 201 lblla : 



2011; 



[20011 iRose et all l2002t lEinasto et al.l l2007d I2008I) 7a 



percluster environment with a wide range of densities af- 



Shet h & Diaferiol 1201 It IPimbblet et all 
l201ll) . The overall shapes of superclusters have been de- 
scribed by the shape param e ters o r approximated by tri- 
axial ellipses {ja aniste etalJ Il998t iB asilakos et al. 2001; 
i Kolokptronis et al.ll2002t lBasilakosll2003t lEinasto et al]l2007ai 
1201 lbllatlCosta-Duarte et al.ll201 ltlLuparello et al.ll201 ll) . These 
studies showed that elongated, prolate structures dominate 
among supercluster s. The results obtained using the mom ents 
of inertia tensor (Basilakos et alJ 1200 it IBasilakos] 120031) or 
the Minkowski functionals are in a go od agreement (see also 
lEinasto et afll2007dl2THlal) . In addition, Basila kos et al] ([2006) 
analysed correlations between supercluster properties from sim- 
ulations and find that the amplitude of the supercluster - cluster 
alignment increases (weakly) with superclusters filamentarity. 

The properties of s uperclusters are de t ermin ed by their for- 
mation and evolution. Kolok otronis et al.l ([2002) show that the 
shapes of superclusters agre e better with a ACDM model than 
with a rCDM model. Also iLuparello et alJ (1201 ll) found that 
the shapes of observed superclusters agree with those in the 
ACDM model. In the ACDM concordance cosmological model, 
the matter density Q m dominated in the early universe and 
the structures formed by hierarhical clustering driven by grav- 
ity. As the universe expands, the average matter density de- 
creases. At a certain epoch, the dark energy density £2a became 
higher than the matter density, and the universe started to ex- 
pand acceleratingly. Simulations of the evolution and the fu- 
ture of the structure in an accelerating universe show the freez- 
ing of the web - the large-scale evolution of structures slows 
down dLoey2002tlNagamine & Loebll2003t iDunner et alJl2006t 
iHoffman et alJ l2007t iRr auss & Scherreg l2007t and references 
therein - ) . lArava-Melo et al.l d2009al) show that this affects the 
sizes, the shapes, and the inner structure of superclusters, and 
they become rounder, smaller, and their multiplicity decreases. 
According to our present results, this suggests that in the future 
superclusters become less elongated and the scatter in the scal- 
ing relation of superclusters may decrease. 

Summarising, our study showed that 

1) The PCA and Spearman's correlation test showed the ab- 
sence of correlations between the physical properties of 
superclusters and their distance, therefore the distance- 
dependent selection effects were taken into account properly 
when generating supercluster catalogues. 

2) The correlations between the properties of superclusters are 
tight. Different physical parameters (the luminosity, the vol- 
ume, and the diameter) and the morphological parameters 
(the dumpiness and the shapefinders) of superclusters are 
equally important in shaping the properties of superclusters. 
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3) The first two principal components account for more than 
90% of the variance of the supercluster properties and define 
the fundamental plane of superclusters. This suggests that 
superclusters can be described with a few physical and mor- 
phological parameters. We derived the scaling relation for 
superclusters using data about their luminosities, diameters, 
and shapefinders. 

4) Superclusters can be divided into two populations accord- 
ing to their luminosity, using the luminosity li mit Lg = 400 
lO w h~ 2 LQ. In agreement with lEinasto et al.l d201 lal) . we 
find that high-luminosity superclusters can be divided into 
two sets: more elongated systems with the shape parameter 
K\IKi < 0.5 and less elongated ones with K1/K2 > 0.5. 

For our study we chose a small sample of superclusters least 
affected by selection effects. To understand the properties of su- 
perclusters better the next step is to study a large sample of su- 
perclusters and high-redshift superclusters. A few superclusters 
at ver y high redshifts h ave al ready been discovered ( Nakat a^etaD 
i05t ISwinbank et al.ll2007[ iGal et al.ll2008t iTanaka et al.l l2009: 
201 ll ISchirmer et all 1201 ll). Deep 
20081) will 



2005; 

Planck Collaboration et al.1 

surveys like the ALHAMBRA project dMoles et al.l 
provide us with data about (possible) very distant superclusters. 
We also need more simulations with various cosmologies to un- 
derstand the evolution and the properties of superclusters in de- 
tail. 
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Appendix A: Luminosity density field and 
superclusters 

To calculate the luminosity density field, we calculate the lumi- 
nosities of groups first. In flux-limited samples, galaxies outside 
the observational window remain unobserved. To take into ac- 
count the luminosities of the galaxies that lie outside the sample 
limits also we multiply the observed galaxy luminosities by the 
weight Wd- The distance-dependent weight factor Wj was calcu- 
lated as 



W d 



f Q °°Ln(L)dL 
f Ll L n{L)AL ' 



(A.l) 



where L\ 2 = L0lO°' 4(M O ~ M| - 2) are the luminosity limits of the 
observational window at a distance d, corresponding to the ab- 
solute magnitude limits of the window Mi and M2; we took 



Mq = 4.64 mag in the r-band dBlanton & Roweisll2007h . Due to 
their peculiar velocities, the distances of galaxies are somewhat 
uncertain; if the galaxy belongs to a group, we use the group 
distance to determine the weight factor. 




Distance (ti Mpc) 



Fig. A.l. Weights used to correct for probable group members 
outside the observational luminosity window. 



The luminosity weights for the groups of the SDSS DR7 in 
the distance interval 90 ft -1 Mpc> D < 320 h~ x Mpc are plotted 
as a function of the distance from the observer in Fig. IA.ll The 
mean weight is slightly higher than unity (about 1 .4) within the 
sample limits. When the distance is greater, the weights increase 
owing to the absence of faint galaxies. Detai l s of th e calculations 
of weights are given also in Tempel et alJ d201 ll) . In the final 
flux-limited group catalogue, the richness of groups decreases 
rapidly at distances D > 320 h~ l Mpc due to selection effects 
(Tago et al] |2010b lEinasto et al.ll2.01 lab . This is another reason 
to choose for our study superclusters from the distance interval 
90 h~ x Mpc < D < 320 h~ x Mpc where the selection effects are 
weak. Even the poorest systems in our sample contain several 
groups of galaxies being real galaxy systems comparable to the 
Local supercluster. 

To calculate a luminosity density field, we convert the spa- 
tial positions of galaxies r, and their lumino sities L, into spat ial 
(luminosity) densities using kernel densities (I S il vermanl fl 9 86): 



P {T) = Y J K(r-v i ;a)L i , 



(A.2) 



where the sum is over all galaxies, and K (r; a) is a kernel func- 
tion of a width a. Good kernels for calculating densities on a 
spatial grid are generated by box splines B j. Box splines are lo- 
cal and they are interpolating on a grid: 



y,B J (*-o = i, 



(A.3) 



for any x and a small number of indices that give non-zero values 
for B j(x). We use the popular B 3 spline function: 



B 3 (x) = (\x-2\ 3 -4\x- 1| 3 + 6|jc| 3 - 
-4|x+l| 3 + |x + 2| 3 )/12. 



(A.4) 



The (one-dimensional) B3 box spline kernel K^p of the width a 
is defined as 

K B l \x;a,5) = B 3 (x/a)(S/a), 



(A.5) 



where 6 is the grid step. This kernel differs from zero only in the 
interval x e [-2a, 2a]. It is close to a Gaussian with cr = 0.6 in 
the re gion x e [-a, a], so its effective width is 2a (see, e.g. JSaarl 
2009). The kernel preserves the interpolation property exactly 
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for all values of a and 6, where the ratio a/6 is an integer. (This 
kernel can be used also if this ratio is not an integer, and a » 5; 
the kernel sums to 1 in this case, too, with a very small error.) 
This means that if we apply this kernel to N points on a one- 
dimensional grid, the sum of the densities over the grid is exactly 
N. 

(3) 

The three-dimensional kernel K B is given by the direct 
product of three one-dimensional kernels: 



K ( ^(r; a, 6) = K^\x; a, 6)K^\y; a, 6)K ( J\z\ a, 8), 



(A.6) 



(A.7) 



where r = {x,y, z}- Alth ough this is a direct product, it is 
isotro pic to a good degree dSaarfl2009l) . 

In lEinasto et al.l (l2007el) we compared the Epanechnikov, the 
Gaussian, and B 3 box spline kernels for calculating the density 
field. The Epanechnikov and the B 3 kernels are both compact, 
while the Gaussian kernel is infinite and has to be cut off at a 
fixed radius, which introduces an extra parameter. We also found 
that both the Epanechnikov and the B 3 kernels describe the over- 
all shape of superclusters well, while the B3 box spline kernel 
resolves the inner structure of superclusters better. This is why 
we used this kernel in the present study. 

The densities were calculated on a cartesian grid based on 
the SDSS 77, A coordinate system, as it allowed the most efficient 
fit of the galaxy sample cone into a brick. Using the rms veloc- 
ity cr v , translated into distance, and the rms projected radius <x r 
from the group catalogue (T10), we suppress the cluster finger 
redshift distortions. We divide the radial distances between the 
group galaxies and the group centre by the ratio of the rms sizes 
of the group finger: 

"gal,f — ^group "t" 0^gal,i — ^group) / 0~v • 

This removes the smudging effect the fingers have on the density 
field. 

The grid coordinates are calculated according to Eqf3] We 
used an 1 hr l Mpc step grid and chose the kernel width a = 
8 hT l Mpc. This kernel differs from zero within the radius 
16 h~ l Mpc, but significantly so only inside the 8 h~ l Mpc ra- 
dius. As a lower limit for the volume of superclusters we used 
the value (a/2) h~ l Mpc 3 (64 grid cells). In this way we ex- 
clude small spurious density field objects which include almost 
no galaxies. lLiivamagi et al.l (12010) tested the method generating 
the superclusters from the Millenium simulations. This compari- 
son showed that supercluster algorithms work well, and, in addi- 
tion, the selection effects have been properly taken into account 
when generating a supercluster catalogue from flux-limited sam- 
ple of galaxies. 

Before extracting superclusters we apply the DR7 
mask constructed by P. Arnalte-Mur (Marti nez et al.l [2009; 
iLiivamagi et al.ll2010l) to the density field and convert densities 
into units of mean density. The mean density is defined as 
the average over all pixel values inside the mask. The mask 
is designed to follow the edges of the survey and the galaxy 
distribution inside the mask is assumed to be homogeneous. 



integr ated mean curvature C, and the integrated Gaussian curva- 
ture y (ISahni et al.ll998l:lMartmez & Saarl2002Ushandarin et al.l 
l2004HSaar et al.ll2007HSaaril2009h . 

With the first three Minkowski functionals, we calculate 
the dimens ionless shapefind e rs K\ (planarity) and K 2 (fila- 
mentaritv) (ISahni et all [l 9981: IShandarin etafl 120041) . See also 



Basilakos et al. (2001), in this study the shapefinders were deter- 
mined with the moments of inertia method. First we calculate the 
shapefinders H\-H 3 with a combination of Minkowski function- 
als: H x = 3V/S (thickness), H 2 = S jC (width), and H 3 = CI An 
(length). Then we use the shapefinders H\-H 3 to calculate two 
dimensionless shapefinders K\ (planarity) and K 2 (filamentar- 
ity): #i = (# 2 -ffi)/(#2+#i) and K 2 = (H 3 -H 2 )/(H 3 +H2). We 
characterise the overall shape of superclusters using planarity K\ 
and filamentarity K 2 , and their ratio, K\/K 2 (the shape parame- 
ter). 

The fourth Minkowski functional V 3 , describes the topol- 
ogy of the surface and gives the number of isolated clumps, 
the number of void bubbles, and the number of tunnels (void s 
open from both sides) in the region (see, e.g. ISaar et al.ll2.Q07b . 
Morphologically the superclusters with low values of the fourth 
Minkowski functional V3 can be described as simple spiders or 
simple filaments. High values of the fourth Minkowski func- 
tional V3 suggest a complicated (clumpy) morphology of a su- 
percluster, described as mult ispiders or multibranching filaments 
(lEinasto et al]|2007eil20TTah . 

Appendix C: Data on luminous (L g > 400 10 1() /z 2 L Q ) 
superclusters 



Appendix B: Minkowski functionals and 
shapefinders 

The supercluster morphology is fully characterised by the four 
Minkowski functionals V0-V3. For a given surface the four 
Minkowski functionals (from the first to the fourth) are propor- 
tional to the enclosed volume V, the area of the surface S , the 
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Table C.l. Data on luminous (L g > 400 10 10 /z 2 L©) superclusters 



(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


(12) 


(13) 


ID 


ID 


Distance 


L g 


^gal 


Volume 


Diameter 


^peak 


v 3 


#i 


K 2 


Ki/K 2 


ID EOl 






Mpc//z 


10 10 r 2 L o 




(/i-'Mpc) 3 


Mpc//i 












1 


239+027+009 


264 


1591.5 


1038 


8435 


50 


21.6 


2 


0.080 


0.152 


0.527 


162 


10 


239+016+003 


111 


680.2 


1463 


3378 


22 


16.4 


1 


0.038 


0.015 


2.456 


160 


11 


227+006+007 


233 


1476.0 


1222 


8065 


35 


16.7 


4 


0.053 


0.049 


1.081 


154 


24 


184+003+007 


230 


1768.2 


1469 


10040 


56 


14.1 


5 


0.089 


0.145 


0.616 


111 


38 


167+040+007 


224 


660.7 


586 


3243 


22 


13.8 


2 


0.023 


0.040 


0.593 


95 


55 


173+014+008 


242 


1773.0 


1306 


9684 


50 


12.3 


5 


0.091 


0.179 


0.509 


111 


60 


247+040+002 


92 


527.4 


1335 


2472 


21 


12.0 


2 


0.013 


0.021 


0.645 


160 


61 


202-001+008 


255 


4315.3 


3056 


23475 


106 


12.9 


13 


0.126 


0.459 


0.274 


126 


64 


250+027+010 


301 


1305.4 


619 


6058 


55 


12.6 


4 


0.091 


0.229 


0.399 


164 


87 


215+048+007 


213 


477.8 


445 


2301 


21 


11.0 


2 


0.039 


0.026 


1.494 




94 


230+027+006 


215 


2263.4 


1830 


11256 


54 


11.1 


8 


0.113 


0.399 


0.284 


158 


129 


170+053+010 


309 


526.7 


223 


2321 


20 


10.6 


3 


0.029 


0.048 


0.612 




136 


189+017+007 


212 


523.2 


504 


2590 


20 


10.9 


2 


0.027 


0.030 


0.925 


271 


152 


230+005+010 


301 


907.5 


423 


4756 


32 


10.8 


3 


0.057 


0.097 


0.585 


160 


189 


126+017+009 


267 


771.0 


433 


3063 


43 


9.5 


4 


0.070 


0.190 


0.372 




195 


134+038+009 


280 


487.9 


273 


2200 


23 


9.9 


2 


0.031 


0.031 


1.004 




198 


152-000+009 


284 


863.9 


473 


4448 


38 


9.7 


4 


0.050 


0.103 


0.490 


82 


223 


187+008+008 


268 


703.7 


462 


3368 


33 


9.3 


3 


0.051 


0.142 


0.361 


111 


228 


203+059+007 


210 


644.0 


643 


3361 


31 


9.5 


2 


0.040 


0.040 


0.992 


133 


327 


170+000+010 


302 


419.8 


205 


1747 


20 


8.5 


2 


0.016 


0.071 


0.228 




332 


175+005+009 


291 


664.3 


333 


3128 


27 


8.2 


3 


0.062 


0.078 


0.788 


106 


336 


172+054+007 


207 


1003.6 


1005 


4605 


53 


8.7 


5 


0.082 


0.246 


0.332 


109 


349 


207+026+006 


188 


768.8 


893 


3942 


42 


8.8 


4 


0.064 


0.105 


0.610 


138 


350 


230+008+003 


105 


436.3 


955 


1987 


22 


8.0 


2 


0.022 


0.059 


0.383 


160 


351 


207+028+007 


225 


689.1 


615 


3292 


32 


8.7 


4 


0.056 


0.086 


0.647 


138 


366 


217+020+010 


300 


763.4 


353 


3681 


31 


8.1 


4 


0.064 


0.156 


0.409 


158 


376 


255+033+008 


258 


658.0 


437 


3097 


27 


8.6 


4 


0.050 


0.041 


1.228 


167 


474 


133+029+008 


251 


612.6 


389 


2299 


43 


7.6 


4 


0.068 


0.223 


0.307 


76 


512 


168+002+007 


227 


410.7 


371 


1658 


26 


7.5 


3 


0.040 


0.082 


0.490 


91 


530 


192+062+010 


306 


790.3 


333 


3690 


40 


7.5 


4 


0.084 


0.207 


0.409 




827 


189+003+008 


254 


572.4 


405 


2238 


30 


6.7 


4 


0.052 


0.116 


0.450 





Notes. Columns are as follows: 1: ID in L10 catalogue; 2: supercluster ID (AAA+BBB +ZZZ, AAA - R.A., +/-BBB - Dec, CCC - lOOz); 3: the 
distance of the supercluster; 4: the total weighted luminosity of galaxies in the supercluster, Lg; 5: the number of galaxies in a supercluster, A^i; 
6: the volume of the supercluster, Volume; 7: the supercluster diameter, Diameter (the maximum distance between galaxies in the supercluster); 8: 
the peak density Op ea j, of the supercluster, in units of mean density; 9: the maximum value of the fourth Minkowski functional, V3 (dumpiness), 
for the supercluster; 10 - 12: shapefinders K\ (pl anarity) and K 2 (filam entarity), and the ratio of the shapefinders K\jK 2 of the full supercluster. 
13: ID EQl : the supercluster ID in the catalogue by Einasto et al. (2001). SCI 160 - the Hercules supercluster, SCI 111 and SCI 126 - members of 
the Sloan Great Wall, SCI 158 - the Corona Borealis supercluster, SCI 138 - the Bootes supercluster, SCI 336 - the Ursa Majoris supercluster. 
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